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Abstract  -A  multi-decoder  design  far  PLA  devices  is  intro¬ 
duced  end  found  to  be  superior  to  both  two  decoder  ROM  and 
single  decoder  PLA  devices  in  implementing  a  special  class  of 
Boolean  expressions.  In  this  class  the  logic  expressions  may 
be  lengthy  but  are  restricted  in  the  number  of  input  variables 
comprising  each  p-term.  A  theoretical  analysis  of  the  area 
efficiency  of  the  new  design  is  supplemented  by  CAD  design 
examples  which  verify  its  superiority.  Implementation  of  the 
multi-decoder  design  using  three  dimensional  microcircuit 
topography  to  attain  even  greater  savings  in  area  and  jpeed  is 
considered  in  the  conclusion  of  this  paper. 

I.  INTRODUCTION 

A  ROM  device  with  two  decoders,  n  inputs  and  l  out¬ 
puts  requires  an  area  proportional  to  the  number  of  cells 
or  crosspoints  which  total  2n2(n/z)  -  I2n  For  simplicity 
we  assume  the  two  decoders  are  identical.  The  useful 
space  for  data  storage  is  of  size  2n2<"/z,  cells.  If  a  ROM  is 
implemented  m  a  one  decoder  design,  equivalent  to  a 
single  decoder  PLA  capable  of  accommodating  the  max¬ 
imum  number  of  p-terms  (2")  given  n  total  inputs,  it 
wcuid  be  of  size  (2 n  -  l  )2n  with  an  unacceptable  decoder 
area  of  2n2".  In  addition  to  requiring  less  area,  a  two 
decoder  design  has  a  propagation  delay  proportional  to 
vT 2(n/ZI  while  a  single  decoder  ROM  has  a  delay  propor¬ 
tional  to  2*  as  an  order  of  magnitude  estimate  for  both 
cases,  since  PLA  devices  are  usually  designed  with  a  sin¬ 
gle  decoder,  the  transition  to  a  two  decoder  PLA  design 
is  justifiable  only  if  a  sufficiently  large  number  of  p- 
terms  are  required  for  the  output  expression!  s).  Nor¬ 
mally  the  decomposition  to  a  two  decoder  architecture 
requires  a  preconditioning  of  the  given  Boolean  expres¬ 
sions  that  is  likely  to  result  m  the  decision  to  utilize  a 
two  decoder  ROM  architecture. 

In  Section  II  we  consider  a  particular  class  of 
expressions  m  wmch  the  n  inputs  are  partitioned  into  s 
subsets  of  r  inputs  each  and  in  which  each  subset  can 
contribute  at  the  most  q  variables  to  each  p-term.  Each 
p-term  in  the  final  output  expression  may  then  contain 
up  to  s  x  q  input  variables.  It  is  found  that  there  are 
combinations  of  values  of  q.  n  and  s  for  which  a  two 
decoder  PLA.  capable  of  implementing  any  expression  in 
the  above  class,  is  quite  superior  to  both  a  one  decoder 
PLA  and  to  a  regular  two  decoder  ROM.  The  new  muiti- 
decoder  PLA  designs  are  ROM-like  in  the  sense  that  the 
AND  arrays  (the  decoders)  are  preprogrammed,  and  are 
PLA-Uke  in  the  sense  that  they  implement  only  a  res¬ 
tricted  class  of  expressions.  Potential  applications  where 
such  large  but  restricted  expressions  might  arise 
include  pattern  recognition,  knowledge  based  systems 


and  artificial  intelligence  The  proposed  multi-decoder 
PLA  devices  need  not  be  restricted  to  the  aforemen¬ 
tioned  cases  with  preprogrammed  decoders.  This  is  done 
in  Section  II  only  to  establish  the  need  and  utility  of  such 
devices  which  then  could  be  designed  without  prepro¬ 
gramming  the  decoders  to  supply  greater  flexibility  and 
potential  use 

The  multi-decoder  PLA  design  is  compared  to  a  ROM 
design  by  using  both  the  simplified  theoretical  analysis 
in  Section  11  as  well  as  the  more  precise  area  assessment 
made  in  Section  III  based  on  actual  CAD  layout  design 
examples  The  ROM  design  m  Section  III  is  based  on  an 
Interesting  approach  utilizing  a  dual  decoder  architec¬ 
ture  which  can  be  viewed  as  a  cascade  of  two  one 
decoder  ROMs.  This  design  consequently  lends  itself  for 
iterative  cascading  of  single  decoder  ROMs  to  produce 
multi-decoder  ROMs  and  PLAs  Also,  in  the  case  of  a  ROM. 
the  exact  allocation  of  inputs  to  each  decoder  is  decided 
upon  by  minimizing  a  desired  cost  function  such  as  total 
silicon  area  as  explained  in  Section  III  The  possibility  of 
a  multi-decoder  ROM  implemented  with  three- 
dimensional  microcircuit  topography  having  substan¬ 
tially  reduced  propagation  delay,  is  among  the  topics 
recommended  for  further  research  in  the  conclusion. 
Section  IV 

D.  A  MULTI-DECODER  DESIGN  FOR  A  CLASS  OF  PLA  DEV¬ 
ICES 

In  this  section  we  consider  the  performance  of  a 
new  class  of  multi-decoder  PLA  devices  that  implement  a 
class  of  Boolean  expressions  with  restricted  p-terms  as 
mentioned  above.  These  new  devices  are  found  to  be 
more  area  efficient  than  a  ROM  or  single  decoder  PLA 
implementation  of  s’  ch  expressions.  The  following 
definitions  are  used 

n  =  number  of  inputs 
l  -  number  of  outputs 

s  =  number  of  subsets  of  inputs,  each  of  size  r 
q  =  maximum  contribution  from  each  of  the  input 

subsets  to  each  p-term 

IF,  =  width  of  decoder  for  one  decoder  PLA  design 
fFj  =  width  of  each  decoder  for  a  multi-decoder 
PLA  design 

W3  =  width  of  each  decoder  for  a  two  decoder 
ROM  design 

D\  =  area  of  decoder  In  a  one  decoder  PLA  design 
Di  -  total  decoder  area  in  a  multi-decoder 
PLA  design 
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Z?3  =  total  decoder  area  in  a  two  decoder  ROM 
A |.  Aj  «  A3  =  total  chip  area  for  a  one  k  two  decoder 
PLA,  and  two  decoder  ROM  respectively 
Mz  k  =  total  data  storage  area  tor  above  three 
cases 

where  all  areas  are  measured  in  terms  of  number  of  cells 
or  crosspoints. 


From  (2.5).  the  following  sufficient  condition  for  A z  <  As 
is  easv  to  deduce 


2(1  +  logsr) 
and  for  s  =2  we  get 
n 


9* 


21ogn 


(215) 


We  consider  first  the  case  of  a  one  decoder  PLA 
design.  The  maximum  width  of  the  decoder,  which 
corresponds  to  generation  of  the  maximum  number  of 
p-terms  given  the  above  restriction  of  having  q  entries 
from  each  of  the  s  input  subsets,  is 
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and  its  size  is 


(21) 


D,  =  ZnW,  (2.2) 

The  data  storage  space  is  of 

Mx  =  lfYl  (2.3) 

locations  and  the  total  device  size  is 


A,  =  0,  +  Ux  =  (2n  ♦  f)*,  (2.4) 

For  an  s  decoder  PLA  design,  with  each  decoder  having 
one  of  the  s  input  subsets  as  its  input,  we  get 


W,  = 


-2* 


8  ?f(»--3)! 

The  total  size  of  the  s  decoders  is 


(25) 


Dz  -  s(2rWJ,  =  2nWj  =  2 n*v (2.8) 
and  the  data  storage  space  is  of  sue 

Uz  =  fWj*  =  i*,  =  M,  (2.7) 

The  chip  size  is  then 


Az  -  Dz+Mz  =  ZnlTz-rlfVz'  (2.3) 

The  memory  space  in  both  the  one  and  multi-decoder 
PLA's  are  of  identical  size,  but  the  size  of  the  decoders  is 
reduced  dramatically  in  the  multi-decoder  design,  as  is 
apparent  from  the  ratio. 


Dz  _  1 

f?, 


(2  9) 


from  which  we  get 

Az_=  (2n/ By1)  +  (  .  t 
<4|  2n+(  2n-rf 


(2  10) 


The  approximate  expression  above  likewise  applies  to  the 
ratio  of  the  areas  of  the  one  and  two  decoder  ROM  s  as 
mentioned  in  the  introduction,  which  illustrates  that  the 
new  multi-decoder  PLA  design  achieves  the  same  advan¬ 
tages  in  comparison  to  a  one  decoder  PLA  for  imple¬ 
menting  the  class  of  Boolean  expressions  defined  above 
In  addition  to  demonstrating  the  superiority  of  the 
multi-decoder  PLA  in  comparison  to  a  one-decoder  PLA 
design,  we  need  to  consider  the  conditions  under  which 
it  compares  favoraolv  with  a  two-decoder  ROM  for  which 
we  have. 


1 =  2"' 2 

(2  ::) 

Dz  =  2n(2"/2) 

(2  :2) 

< 

u 

II 

:2  :3) 

A 3  =  Dz  *  =  (2n  *  !2"/!)2"/J 

12  : 4 ) 

which  gets  closer  to  an  equality  as  n  increases  and  q 
becomes  smail.  As  n  increases,  the  ratio  Aj/Aj 
decreases  rapidly  The  main  source  of  the  inefficiency  of 
the  ROM  implementation  for  this  range  of  values  of  n  is 
that  it  contains  much  more  memory  space  than  is 
needed  for  the  restricted  class  of  expressions  of 
interest.  For  example,  for  s=2  and  a  given  l  and  q  .  Az 
increases  as  0(n2*)  while  A3  increases  as  0(2" )  3o  the 
area  of  a  ROM  increases  exponentially  m  rt  versus  the 
polynomial  increase  m  the  PLA  implementation 

To  illustrate  the  performance  of  the  proposed 
multi-decoder  PLAs  we  consider  the  following  examples 

Example  1:  For  n  =  ;B.  s=2.  q  =2  and  l  =  1  we  derive 
*2=144.  Az  =  2.59  £  10* 

while 

*3  =  5:2  and  A,  =  2.B:i  105 

and 

A*/Aa  =  0.092 

A  one  decoder  PLA  will  require 

*,=2.07x  ;D«  and  A(  =7.67  x  :0s 
and 

A*/A,=0  033 


Dearly  the  two  decoder  PLA  is  superior  to  both  the  ROM 
implementation  which  has  more  area  and  capability 
than  is  needed  for  the  task  at  hand,  and  to  the  one 
decoder  PLA.  which  is  too  long  for  practical  implementa¬ 
tion. 


Example  2:  Same  as  in  Example  :  but  with  l  =  10  outputs. 
In  this  case  we  get 

A,  =  2  13=  :o5 
Aj/.4,  =  o  oa: 
and 


Az/  A,  =0  22 

where  the  ROM  becomes  more  inefficient  w-hile  the  per¬ 
formance  of  the  one  decoder  PLA  improves  slightly  as 
predicted  by  Eq  (2  :0).  but  still  remains  inferior  to  the 
two  decoder  design 

Example  3.  Here  we  let  n  =24  s  =2.  q  =3  and  1  = : 

In  this  case  we  get 
Az-  3  16i  10* 

and 


Az/  Az=  0  .87  A?/  A,  =0  02 


The  same  case  with  7  =2  would  have  resulted  in 
At  -  3  24  2  ;04 

and 

At/ At-  0  0048.  At/ A]  =  0  02 

which  illustrates  the  small  range  of  values  of  7 .  Ss  is  also 
suggested  by  Eq.  (2  15)  for  which  the  new  technique  is 
superior  to  regular  two  decoder  ROM  devices 

Example  4.  For  n  =32.  s=2.  7=3  and  f  =  '.  we  obtain 
At-Z  61  x  ;0:.  At/A,  =  6  07z  :0~5 

and 

At/Ax- 0015  . 

reflecting  the  increased  t  Lciency  of  the  new  approach 
as  n  increases 


m.  CIRCUIT  DESIGN  EXAMPLES  &  ANALYSIS 

When  considering  the  class  of  Boolean  expressions 
mentioned  above  the  use  of  a  multi-decoder  PLA  with 
restricted  input  partiomng  is  the  only  single  chip  solu¬ 
tion  for  large  n.  The  use  of  a  standard  single  decoder 
PLA  to  implement  the  class  of  expressions  is  not  practi¬ 
cal  because  of  the  very  long  and  narrow  chip  that  would 
result.  A  two’ decoder  ROM  would  not  be  possible  because 
of  the  unacceptable  amount  of  silicon  area  necessitated 
by  large  n  To  demonstrate  the  implementation  of  a  two 
decoder  PLA  where  the  decoders  (AND  arrays)  are  pre¬ 
programmed.  two  single  decoder  PLAs  are  cascaded 
together  with  the  outputs  from  one  PLA  input  to  a 
second  PLA  This  approach  allows  for  several  sum  of  pro¬ 
ducts  expressions  formed  from  one  PLA  to  be  ANDed  with 
any  individual  p-term  generated  by  the  decoder  of  ‘he 
second  PLA  The  resulting  expressions,  also  in  sum  of 
products  form,  may  then  be  ORed  together  in  the  output 
OR  array  to  form  the  final  output  expression  The  basic 
architecture  on  which  the  two  decoder  PLA  design  is 
based  is  the  standard  PLA  architecture  (1.2.3)  using  the 
"AND-OR"  structure 

For  implementation  of  the  two  decoder  PLA  design. 
“NOR"  structures  are  used  for  the  AND  and  OR  arrays 
with  two-phase  nonoverlapping  clocks  (♦!.  *2)  and  array 
precharging  Fig  1  depicts  the  schematic  representa¬ 
tion  of  the  two  decoder  PLA  with  PLA1  being  the  upper 
array  and  PLA2  being  the  lower  array  A  new  input  can  be 
supplied  every  $1  period.  Given  an  input  at  time  t=Q. 
the  desired  output  is  available  at  time  t=t  -  241.  Let  us 
consider  logic  flow  through  the  PLA  starting  at  t=2. 
When  41  is  hign  the  AND  planes  are  precharged  and  the 
inputs  from  t=0  are  allowed  to  propagate  into  the  AND 
pianes  AND!  and  AND2  while  inputs  from  t=t  -  41  pro¬ 
pagate  into  AND3  As  4i  goes  low  the  inputs  from  t=0 
are  latched.  When  42  goes  high  the  product  terms  are 
evaluated  (AND  pianes).  then  propagate  into  the  OR 
pianes  that  are  being  precharged  and  are  latched  when 
42  goes  low  As  the  next  41  goes  high  the  OR  planes  are 
evaluated  by  allowing  the  value  of  the  sum  of  products 
expression  to  discharge  the  precharged  OR  line  (the  line 
may  or  may  not  be  discharged  depending  on  the  value  of 
the  exoression)  and  the  outputs  from  PLA1  and  PLA2  are 
latcned.  It  takes  one  more  period  of  4i  to  evaluate  the 
OR  2  plane  and  latch  the  output  (OUT). 

A  ’C"  program  was  written  to  automatically  layout 
the  two  decoder  PLA  design  described  above.  The  physi¬ 
cal  lavout  of  the  design  :s  baseo  on  the  scalable  MO  l 


design  rules.  Fig  2a  depicts  a  two  decoder  PLA  having 
the  functional  capaointy  of  -a  ROM.  with  16  total  inputs 
and  one  output.  The  decoders  are  fully  decoded,  each 
having  8  inputs  and  256  3-bit  p-terms.  Fig  2b  depicts  a 
two  decoder  PLA  also  having  16  total  inputs  and  one  out¬ 
put  but  with  restricted  input  partiomng  (7=2)  which 
results  in  112  2-bit  pterms  per  decoder.  Both  designs 
are  capable  of  implementing  the  class  of  expressions  of 
interest,  however  the  ROM  design  has  more  capability 
than  ls  needed  for  the  task  at  hand.  These  designs  form 
the  basis  for  the  following  comparative  analysis  between 
a  two  decoder  ROM  and  two  decoder  PLA. 

Given  the  two  designs  illustrated  in  Fig  2.  we  can 
determine  precisely  the  silicon  area  A?  required  to 
implement  a  two  decoder  PLA  (s=2)  with  restricted 
input  partitioning  as  defined  in  Section  1.  Ap  is  then  com¬ 
pared  to  the  total  silicon  area  Aj>  required  to  implement 
a  two  decoder  PLA  with  unrestricted  ROM  capability 

Adhering  to  the  scalable  MOSIS  design  rules  and  the 
particular  design  shown  in  Fig  2.  a  set  of  equations  were 
empirically  developed  to  determine  absolute  layout 
dimensions  for  any  size  two  decoder  PLA.  The  length  Lp 
and  width  Wp  of  the  layout  are  given  by 


Lp  =  C, 


n/2; 

(71/2-7)171 


2*1  +  C2{1  l  MOD  2) 


<32) 

where 

n  =  total  number  of  inputs 

7  =  number  of  variables  each  decoder 
contributes  to  a  p-term 

L  =  total  number  of  outputs 


representing  tbe  relevant  parameters  alTecting  device 
area  and  C, .  C2.  C,,  K,.  Kz  and  Kj  are  all  design  and 
technology  dependent  constants  These  constants 
represent  the  contribution  of  device  substructures  (e  g 
latches,  pullups.  various  metallizations.  OR  and  AND 
cells)  to  total  device  proportions  Eqs  (3  '.)  and  13  2) 
apply  only  to  the  particular  design  implemented  in  Fig  2 
and  may  vary  somewhat  in  form  with  even  slight  archi¬ 
tectural  alterations  Normalizing  Eqs  (3  '.)  and  (3  2)  to 
the  feature  size  A  and  inserting  values  for  constants  we 
obtain 
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where  A  may  equal  .75.  1.0  c.-  1.5  microns,  corresponding 
to  1.25.  2.0  or  3.0  micron  geometries  as  designated  by 
the  MOSB  scalable  design  rules.  The  modulus  operator 
(MOD)  present  in  Eqs.  (3.1)  and  (3.3)  simply  reflects  the 
architectural  imposition  that  an  even  number  of  output 
lines  must  be  present  in  the  layout  regardless  of  actual 
number  of  outputs,  to  maintain  structural  symmetry  in 
the  output  OR  array.  Total  silicon  area  for  the  two 
decoder  PLA  architecture.  Ap.  is  then  simply  the  product 
of  Eqs.  (3.3)  and  (3.4)  given  by 

Ap  =  Lp  WP  A*  (3.5) 

where  area  is  expressed  in  terms  of  A*.  It  shouid  be 
noted  that  the  design  chosen  here  to  implement  the  PLA 
architecture  is  highly  compact,  leaving  very  little  unused 
real  estate  within  layout  boundaries. 

We  next  employ  an  approach  similar  to  that  above 
to  determine  Ap.  The  length  Lp  and  width  Wp  of  the  sili¬ 
con  layout  are  given  by 

Lp  =  CyZ*  4-  C2(l  l  MOD 2)  4-  <7,2“-" 

4  (3.6) 

Wp  =  Kx  m  +  KZL  2—"  *  K 3  (3.7) 

where 

tn  =  number  of  inputs  entering  upper 
decoder  (PLA) 

n  =  total  number  of  inputs 

{  =  total  number  of  outputs 


It  should  be  noted  that  the  variable  q  is  not  considered 
here  since  there  are  no  restrictions  placed  on  the  length 
of  any  p-term.  each  of  which  may  contain  up  to  n  vari¬ 
ables  in  the  Anal  output  expression.  Once  again  normal¬ 
izing  to  A  and  inserting  precise  values  for  constants  we 
obtain 

Lp  =  10.5(2")  *11(11  1  MOD  2)  4-  13(2“-") 

♦  646  A  (3.8) 

Wp  =  22m  4-  13 1  (2"*")  +  288  A  (3.9) 


Total  silicon  area  for  this  structure  is  then 

Ap  =  Lp  Wp  A2  (3. 10) 

A  particular  value  for  m  can  be  determined,  m  which 
minimizes  Ap  given  a  particular  n  and  I  For  this  design 
m  is  always  greater  than  n  /  2  resulting  in  a  fully  minim¬ 
ized  Ap  but  having  an  unacceptably  long  rectangular  sili¬ 
con  layout  A  chosen  m  in  the  range  n/2  <  m  <  m  will, 
for  most  selected  values  of  n  and  I.  result  in  a  more 
squarish  and  practical  silicon  layout  at  the  expense  of  a 
slignt  increase  in  Ap  provided  n  is  not  too  large 


Area  efficiency  comparisons  can  be  made  between 
the  dual  decoder  ROM  and  PLA  layouts  described  above 
by  examining  the  .atio  Ap/Ap.  fable  3.1  lists  several 
values  for  Ap/Ap  given  sample  values  for  n.  L  and  q  Also 
listed  are  absolute  values  for  Ap  given  in  terms  of  Az  For 
n  >  16.  Ap  was  calculated  using  7n.=n/2  for  comparative 
purposes  only,  since  practical  implementation  is  impos¬ 
sible. 

It  can  be  deduced  from  Table  3.1  that  the  ratio 
Ap /Ap  based  on  the  particular  design  given  above,  coin¬ 
cides  cioseiy  with  the  ratio  A^/A-s  derived  by  theoretical 
analysis  in  Section  II.  It  can  also  be  noted  that  for  q  >  2. 
either  Ap  becomes  too  large  for  practical  implementa¬ 
tion  or  the  ratio  Ap/Ap  favors  usage  of  the  two  decoder 
ROM.  However,  for  ?s2,  it  can  be  concluded  that  the  two 
decoder  PLA  design  implemented  above  provides  for  a 
practical  implementation  of  a  particular  class  of  logic 
expressions  wnereas  a  two  decoder  ROM  implementation 
is  either  impractical  or  less  area  efficient. 


IV.  CONCLUSION 

In  this  paper  we  have  established  the  need  for  a  new 
class  of  multi-decoder  PLA  devices  that  are  superior  to 
both  ROM  and  single  decoder  "PLA  implementations  of 
certain  logic  expressions  that  contain  many  p-terms  but 
satisfy  certain  restrictions.  Such  devices  could  have 
wider  applicability  beyond  this  ciass  of  expressions.  For 
example,  the  proposed  multi-decoder  design  couid  be 
used  for  ROM  implementations  employing  more  than  two 
decoders  based  on  the  design  technique  exemplified  in 
Section  III.  A  particular  ROM  design  using  three  decoders 
would  most  likely  result  m  some  savings  in  total  decoder 
area,  but  more  importantly,  would  result  in  even  greater 
savings  if  implemented  m  three  dimensional  microcir¬ 
cuit  topograpny.  In  this  case  the  propagation  delays 
would  be  approximately  proportional  to  This  is 

an  important  topic  for  further  research  that  can  com¬ 
bine  the  proposed  techniques  presented  in  this  work  with 
new  advances  and  current  research  in  three  dimensional 
microcircuit  devices. 
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MISSION 

of 

Rome  Air  Development  Center 

RAVC  plans  and  executes  res  earch ,  development,  test 
and  selected  acquisition  programs  -in  support  o  f 
Command,  Control,  Communications  and  Intelligence 
[C* I)  activities.  Technical  and  engineering 
support  within  areas  of  competence  is  provided  to 
ESV  Program  Offices  IPOs)  and  other  ESV  elements 
to  perform  elective  acquisition  of  C3 1  systems. 

The  areas  of  technical  competence  include 
communications ,  command  and  control,  battle 
management,  information  processing,  surveillance 
sensors,  intelligence  data  collection  and  handling, 
solid  state  sciences,  electromagnetics ,  and 
propagation,  and  electronic,  maintainability , 
and  compatibility . 


